Skip to content

This commit contains all significant contributions until 23/8/2016 #761

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 16 commits into
base: 4.x
Choose a base branch
from

Conversation

anguelos
Copy link
Contributor

This pullrequest changes

Implemented ImagePreprocessor a genealogy of functors for preprocessing
samples before they are fed to a CNN.

Implemented TextImageClassifer as an abstraction of any object that
provides a vector of probabilities given an image. A gluing class
providing compatibility to callbacks for OCRBeamsearchDecoder allows
to plug any such object to the existing algorithms.

Implemented DeepCNN a class that is supposed to load pre-trained Caffe
models. Although several back-ends have been planed the only operational
one is the one employing Caffe its self. GPU is fully supported but can only
be enabled and disabled globally.

Implemented a class using a DeepCNN that implements word-spotting

Implemented a python and a C++ demo demonstrating the Word-spotting
Class

Implemented a class that generates synthetic text samples at high speeds
and with a full pipeline of realistic distortions for training
Word-spotting CNN's who need millions of samples.

Implemented a python demo demonstrating the usage of the text synthesizer

Implemented ImagePreprocessor a geneology of functors for preprocessing
samples before they are fed to a CNN.

Implemented TextImageClassifer as an abstraction of any object that
provides a vector of probabillities given an image. A gluing class
providing compatibillity to callbacks for OCRBeamsearchDecoder allows
to plug any such object to the existing algorithms.

Implemented DeepCNN a class that is supposed to load pretrained caffe
models. Although several backends have been planed the only fully ope-
rational one is the one employing caffe its self. GPU is supported.

Implemented a class using a DeepCNN that implements word-spotting

Implemented a python and a C++ demo demonstraiting the WorSpotting
Class

Implemented a class that generates sythetic text samples at high speeds
and with a full pipeline of realistic distortions for training
Word-spotting CNN's who need millios of samples.

Implemented a python demo demonstraiting the usage of the text synthesizer
…y to the synthesizer with the simple highgui
…keLists.txt substituting configure_file with add_definitions.
… empty file as was originally in the master branch
@mshabunin mshabunin added the GSoC label Nov 11, 2016
Copy link
Contributor

@mshabunin mshabunin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have several more comments:

  • DeepCNN_classdiagram.pdf can be removed, because doxygen can generate inheritance diagrams
  • please resolve merge conflicts
  • can we close #723 ?

@@ -1,7 +1,13 @@
#ifndef __OPENCV_TEXT_CONFIG_HPP__
#define __OPENCV_TEXT_CONFIG_HPP__

// HAVE QT5
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually this file is not usable, because there is no corresponding configure_file command in cmake script (like here).

HAVE_TESSERACT works because the definition is being added to the compilation command (with add_definitions(-DHAVE_TESSERACT)).

Actually I think we should use the dnn module to run the network, is it possible?

@@ -61,82 +65,126 @@ enum
OCR_LEVEL_TEXTLINE
};

//base class BaseOCR declares a common API that would be used in a typical text recognition scenario
//base class BaseOCR declares a common API that would be used in a typical text
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please, restore the formatting of all documentation comments and code.

@@ -43,6 +43,7 @@
#include "precomp.hpp"
#include "opencv2/imgproc.hpp"
#include "opencv2/ml.hpp"
#include "opencv2/highgui.hpp"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it needed?

message(STATUS "TEXT CAFFE CONFLICT")
endif()

find_package(Qt5Gui)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It will be better to use HAVE_QT5 from the main repository. (examples)

@mshabunin
Copy link
Contributor

Ping @anguelos .

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants